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ABSTRACT 

An  algorithm  for  the  pole  assignment  problem  has  recently  been  given  in 
[Kautsky, Nichols, Van  Dooren].  Given  the  n  by  n  matrix  i4,  the  n  by  m  full  rank 
matrix  B  {m<n),  and  the  n  complex  numbers  X,,  the  problem  is  to  find  F  such 
that  A+BF  has  the  X,  as  eigenvalues.  The  quality  of  the  solution  is  measured  by 
the  condition  number  of  X,  the  matrix  of  eigenvectors  oi  A+BF.  In  this  note  we 
show  that  a  lower  bound  on  Cond(X)  is  given  essentially  by  the  reciprocal  of  the 
product  of  three  terms:  the  condition  number  of  iB,  the  maximum  condition 
number  of  any  A-X,/,  and  the  relative  distance  from  the  pair  (A^)  to  the 
nearest  uncontrollable  pair.  This  last  term  can  be  interpreted  as  follows:  the 
lower  bound  on  Cond(X)  becomes  large  as  the  relative  distance  from  the 
problem  to  the  nearest  "infinitely  ill-conditioned"  problem  becomes  small. 


1.   Introdoctlon  and  Sammary 

The  pole  assignment  problem  is  defined  as  follows:  given  real  matrices  A  and  B  of 
dimensions  n  by  n  and  n  by  m,  respectively,  B  of  full  rank,  and  a  set  {X,}  of  n  complex 
numbers  closed  under  complex  conjugation,  find  a  real  m  by  n  matrix  F  such  that  A+BF 
has  the  X,  as  eigenvalues.  It  is  well  known  that  this  problem  has  a  solution  for  all  possible 
self  conjugate  {X,}  if  and  only  if  the  pair  {A^)  is  controllable,  i.e. 
[B  \AB  \A^B  \  •  •  •  \A"-^B]  has  full  rank  n  [Wonham].  If  the  pair  (AJB)  is  not 
controllable,  then  some  eigenvalues  of  A+BF  will  be  independent  of  F  (and  be 
eigenvalues  of  A).  In  either  case,  the  column  space  of  [fl  |  A5  |  A'fl  |  •  •  •  |  A"~^5]  is 
called  the  controllable  subspace  of  the  pair  iAJ3).  The  robust  pole  assignment  problem,  as 
defined  in  [Kautsky, Nichols, Van  Dooren],  is  to  find  F  satisfying  the  basic  problem  above 


such  that  X,  the  eigenvector  matrix  of  A+BF,  is  as  well  conditioned  as  possible.  The 
condition  number  of  X,  Cond(X)  -  ||X||  •  ||X~^||,  (||  •  ||  denotes  the  2-norm)  turns  out 
to  measure  the  size  and  sensitivity  of  both  F  and  the  time  dependent  solution  of  the 
control  system  defined  by  A  and  B  (see  [Kautsky, Nichols, Van  Dooren]  for  details). 

In  this  paper  we  consider  lower  bounds  for  Cond(X).  In  particular  we  show  that  a 
lower  bound  on  Cond(X)  is  given  wsentially  by  the  reciprocal  of  the  product  of  three 

quantities:  the  condition  number  of  5,  the  mayiroum  condition  number  of  any  A-X/, 

/-.    •"=•.,-.•  ;.." 

and  the  relatj.-e  distance  from  the.pair  {Afi)*  i  ths  neirest  uncontrollable  pair. 

■•ro\    "    ll 

Theorem:  Let  k^  =  ||  S  ||  ^  /  cr,^(#i,  ; , 


■  ■  --f  -, 

g 

K^  =  max  jl  , 


i       ramc^  J/4-X,) 


and 


""       UWf  \\B\\f        ' 

where  iA^M„)  is  the  uncontrollable  pair  which  minimizes  D^.   Then  any  solution  X  to  the 

robust  pole  assignment  problem  satisfies 

Cond(X)  ^  -4-= — ^^—  >  --^-^ . 

2Vn  D,  Kg  K^        9V„  D,  Kg  k^ 

Let  us  interpret  this  result.  The  distance  from  (A3)  to  the  nearest  uncontrollable 
pair  is  also  the  distance  of  the  matrix  pencil  [B  \A->J]  to  the  nearest  pencil  with  a 
certain  Kronecker  canonical  form,  as  we  now  explain.  The  pencil  [B  |A  — Xi]  can  only 
have  two  types  of  structure  elements  in  its  Kronecker  canonical  form:  L,  blocks 


i.= 


X    1 
X    1 


X    1 
of  dimension  t  by  t+1,  and  Jordan  blocks  y„(z,)-X7„  with  finite  eigenvalue  z,.  (The 

presence  of  L^  blocks  and  infinite  eigenvalues  would  require  the  coefficient  of  X,  [0  |  /], 

to  have  deficient  row  rank,  which  is  dearly  not  the  case.)  It  is  known  ([Wonham])  that 

"-iti  ^  Tit- if.' 

the  pair  (A^)  is  controllable  if  and  only  if  the  pendl  [B  \A->J]has  no  regular  part,  i.e. 

;  .■:;>xr!  srtJ  ,^  to  T- 
no  Jordan  blocks.     Indeed,   for  generic  matrices  A   and  B,  the  pair  (AJB)  will  be 

controllable.  The  uncontrollable  pairs  (A3)  "C  in  a  lower  dimensional  surface  in  the 

space  of  all  matrix  pairs.  The  theorem  states  that  th^^scaled  reciprocal  of  the  distance 

(measured  in  the  D,  norm)  from  a  given  controllable  TJair  (A^)  to  this  surface  will 

essentially  determine  a  lower  bound  on  Cond(X)  as  defined  above.  This  idea  of  relating 

the  conditioning  of  a  problem  to  the  distance  from  that  problem  to  a  lower  dimensional 

surface  of  "infinitely"  ill-conditioned  problems  arises  elsewhere  in  numerical  analysis 

([Kahan],[Demmel]).  The  standard  example  of  this  is  the  condition  number  of  a  matrix 

with  respect  to  inversion  equaling  the  (scaled)  reciprocal  of  the  distance  of  that  matrix  to 

the  nearest  singular  matrix  (this  distance  equals  the  smallest  singiilar  value  of  the  matrix). 

The  presence  of  k^  and  k^  in  the  denominator  of  the  lower  bound  mean  that  as  fi  or 
some  A-X,  become  more  ill-conditioned,  the  lower  bound  becomes  smaller,  and  we  can 
not  guarantee  an  ill-conditioned  solution  by  this  technique.  The  solution  X  may  still  be 
ill-conditioned  in  the  sense  of  being  very  sensitive  to  small  perturbations  in  B  and  A ,  but 
this  behavior  does  not  seem  to  be  captured  by  a  lower  bound  on  its  condition  number. 

2.    Proof  of  the  Theorem 

We   begin   by   summarizing   the  solution  of  the  robust  pole   assignment  problem   as 
described  in  [Kautsky, Nichols, Van  Dooren]:  Let  x,   denote   the  eigenvector  of  A+BF 


corresponding  to  \,.  Then  jc,  must  satisfy 


x,€S, -N(l/[(A-X,/)) 
where  N()  denotes  the  null  space  of  (•)  and 


(1) 


B-[Uo,Uj-  [I]  (2) 

with  U  =  [Uq  ,  Uj  orthogonal  and  Z  nci^iiigular.    Thus,  the  columns  of  Ui  span  the 

orthogonal   complement   of  the   column  spaf,2  of  3.     U  (A^)   is  OTntroUable,   then 

dim(S,)  =  m  [Kautsky.Nichols.Van  DooJdB^.  - 

If  (A^)  is  not  controlleble,  we  will  shov;-  ihiii  the  S,  will  generally  lie  in  a  certain 
n-r  dimensional  subspace  of  k^.-i^  lEhat  u.e  Xi  x'Hl  also  lie  in  this  subspace.  Therefore 
X,  whose  columns  are  the  x,,  wHll  be  stegil' r  with  rank  at  most  n  —  r.  This  is  another  way 
of  saying  that  the  robust  pole  placement  probleic  generally  has  no  solution  for  (A^)  not 
controllable.  If  (A3)  is  dose  to  being  not  controllable,  we  will  show  that  the  S,  will  lie 
close  to  an  n-1  dimensional  subspace,  so  thatX  will  be  nearly  singular  (ill-conditioned). 
(If  some  X,  happens  already  to  be  an  eigenvalue  of  A,  then  dim(S,)  will  be  larger  than  m, 
S,  will  not  necessarily  lie  in  the  n-r  dimensional  subspace,  and  the  robust  pole  placement 
problem  may  have  a  solution.  This  problem  requires  a  separate  analysis,  which  we  will 
not  do  here.) 

Lemma  1:  If  (A, ,5,),  B^  of  full  rank,  is  not  controllable  then  there  exist  unitary  matrices 
P  (n  by  n)  and 


Q  = 


L  0 

0    P 


{n  +  m  hy  n  +  m)  such  that 


P^[BjA,-)J]Q  = 


z- 

^ 

ai->^«. 

^uU 

A. 

U 

0 

^«21 

^u22 

-^In- 

-m-r 

^u 

:3 

0 

0 

0 

A 

1.33 

-\;, 

(3) 


Here  the  matrices  in  the  first,  second,  third  and  fourth  columns  have  m,  m,  n-m-r  and 


r  columns  respectively,  and  the  matrices  in  the  first,  second  and  third  rows  have  m, 
n-m-r  and  r  rows,  respectively.  Furthermore,  Z^  is  a  nonsingular  matrix,  and  A^j^  is 
upper  triangular. 

Proof:  Choose  P  =  [P^l  Pil  ^3]  *'^  ''i  spanning  the  column  space  of  B^,  Pi  spanning 
the  orthogonal  complement  of  the  space  spanned  by  P^^  within  the  controllable  subspace 
of  (A„^^,),  and  P^,  spanning  the  remainder  of  R"  (the  orthogonal  complement  of  the 
controllable  subspace),  such  that  ^^33  is  in  Ssbvf  ymonical  form.  The  form  of  (3) 
follows  as  in  [Wonham,  Chap.  1].  Q.^.D.^,  ,_,„  >    ^    .        .j 

From  (3)  we  see  that  we  can  express  Sjjforjtljfl  g^  (4i,»^.)  as 


0  ^,      .    0  A„33-V, 

Since  '4^33-X/,  is  the  regular  part  of  the  pencil  [5^  |  A^-Xi],  we  have 


S,  =  />  •  N( 


). 


Lemma  2:  If  X,  is  not  a  finite  eigenvalue  of  the  pencil  [fl„  \A^-\I],  then 

S,  C  span[/'i  I  /'J  . 
A  sufficient  but  not  necessary  condition  for  X,  not  being  a  finite  eigenvalue  of  the  pencil 

[B^  I  i4^-  X/],  is  that  it  not  be  an  eigenvalue  of  A^. 

Proof:  If  X,  is  not  an  eigenvalue  of  the  pencil,  then  A„33-X/,  is  a  nonsingular  matrix,  so 
that  any  vector  in  the  null  space  of 

0  0  A.33-X,/,j 

must  have  its  last  r  components  0.  Changing  basis  with  P  yields  the  result.  Q.E.D. 

Now  suppose  the  pair  (A  ,5)  is  controllable,  and  that  the  closest  uncontrollable  pair 
is  (A^^J.  Here  we  measure  distance  using  the  unitarily  invariant  absolute  norm 


D..  - 


A   -A 


+ 


\B.-B\ 

llflllr 


(4) 


2«    ^«ll~^m                •^«12 

A„i3 

0            ^.21           A^22-'>^m-n-r 

^u23 

Bi        A31                 A32 

Kii-K 

where  ||I|;r  denotes  the  Frobenius  norm.  Choose  P  and  C  as  in  (3)  above  to  put  the 
pendl  [B    \A  -  X7]  into  the  indicated  form.  The  next  lemma  describes  the  form  of 
P^[B  \A-\I]Q: 
Lemma  3: 

P^[B  \A->J]Q  = 

Furthermore,  only  the  last  rows  of  B^,  A^j  and  A^j  axe  nonzero.  Note  that  only  the  three 
blocks  at  the  bottom  left  differ  iroir  j;-3;. 

Proof:  Suppose,  to  the  contrary,  that  feere  were  differences  outside  the  three  blocks  B^, 
Aji  and  Aj2-  ^^*  would  contradic!  the  definition  of  (iA„3„)  as  being  the  closest 
uncontrollable  pencil  to  (A^)  since  by  making  P^[A^^^]Q  agree  with  P'''[A\B]Q  outside 
these  three  blocks  we  could  find  an  uncontrollable  pair  even  closer  to  (A^).  Similarly,  if 
B^,  A31  and  A32  had  nonzero  entries  elsewhere  than  in  their  last  rows,  we  could  find  an 
uncontrollable  pair  closer  to  (A^B)  than  (A,,5„)  by  making  P'''[A^\BjQ  agree  with 
P^[A  \B]Q  outside  the  last  rows  of  B^,  A^^,  and  A32.  Q.E.D. 

Now  we  are  ready  to  argue  that  the  S,  for  the  pair  (A  ,5)  all  lie  dose  to  span[/»il/'2] 
if  the  distance  D^  (see  (4))  from  (A ,5)  to  the  nearest  uncontrollable  pair  (A„3„)  is  small 
enough,  and  if  X,  is  far  enough  from  the  spectrum  of  [S|A-X/]. 

Let  R  =  BiZ'K  T  =  (/+i?/?*)"^'%  and  T.  =  {I+R*R)-^'^.  Note  that  only  the  last 
row  of  R  is  nonzero,  and  that 

\\Rh^\\B,\\r\\Z;'\\  ^\\B,\\,/(a^,iB)-\\B,\\,) 

^D^\\B\\,/  {a^,(B)  -  D  Jl fl II  ^)  =  D.  K(fl)  /  (1  -  D,k(5))    . 
Let 


/»» = 


T.       0       0 

0    /„-.-.  0 
0        0       T 

0       I„-n-r      0 


-R       0        /, 

It  is  easy  to  verify  that  P^  is  unitary  and  that  P^  ■  P^[B  \A-\^]Q  = 

T.iZ.  +  R'Bj   T.(A,,-\^^+R*Aj;)   T.iA.^+R^A^^)   T.(A,^+R*A,;i^-R»\,) 


0 
0 


'u21 


-^^22     ^/'/f 


'u23 


TiAj,-RA,,0  +  TRh      nA,^-RA,i2)        nA,:iy-RA,^)-T\, 


so  that 


S,  =  />  •  N( 


^1,21  ^ii22~'*/'«-n-/-  ^^1.23 


) 


The  unitary  fartor  P  does  not  change  any  angles  between  vectors  so  we  may  assume  it 

equals  /  now  for  simplicity.  Thus,  S,  is  perpendicular  to  the  last  row  of  • 

:    J.' 

[A^^—RA^ii+R\i  ,  Aij-RA^^  ,  A^j^-RA^ii-\[I^]    .  (5) 

The  next  lemma  shows  that  if  X,  is  not  too  close  to  an  eigenvalue  of  [fl|A-X/],  then  this 

row  vector  must  lie  within  a  small  angle  of  [0  |  0  |  1]: 

Lemma  4:   Suppose  X,  is  not  dose  to  an  eigenvalue  of  [5|A  — X/]  in  the  sense  that 

where  c>0.  Suppose  D^  k(5)  <  1  and 

£),  k{B)  I  (1-D,  K(fl))  <  c.  Then  the  tangent  of  the  angle  9  between  the  last  row  of  (5) 
and  [0  I  0  I  1]  is  bounded  above  by 


V^.^^-W^  c\\A\\^, 


D,  k(B) 


c  — 


D,  k(B) 


-) 


1-D.  k(B) 

Proof:  tan(6)  equals  the  root-sum-of-squares  of  the  first  n  — 1  components  of  the  last  row 
of  (5)  divided  by  the  absolute  value  of  the  last  component.  It  is  easy  to  see  that  the  root- 
sum-of-squares  of  the  first  n- 1  components,  RMS^,  is  bounded  above  by 

RMS,^4p\\A\\,  +  p\\,\    ,  (7) 

where  p  -  D„  k(B)  /  (\  -  D^  ^(5)).  and  that  the  absolute  value  of  the  last  component. 


RMS 2,  is  bounded  below  by 

RMS2^  K3j_^-X,|-pI|A||jr    .  (8) 

We      consider      two     cases,      |X,1  s  (l  +  c)||A||f.      and      |X,|  a  (1+c)||A||^.      When 

|X,|  =s  (l  +  <:)||A||;r,  then  from  (7)  and  (8)  we  have 

RMS^   ^   p(5+c)  \\A\\^   ^    p(5+c) 

RMS^    ^     (C  -   p)   \\A\\fr  c-p 

When  |X,|  a  (1  +  c) II A II  ^,  (7)  and  (8)  yield 

RMS^        _4_pj|A||^_  _^  P  |X/| 

RMS2   "   (c-p)llAli;       ^,,(i._l^_^) 

(c-p)     .       c-p  c-p 

Now  bound  the  5  +  c  in  each  numerator  by  6.   Q.E.D. 

As  defined  above  c  is  difficult  to  estimate  from  the  initial  data  A,  B  and  X,.  The 
next  lemma  shows  how  to  get  a  simple  bound  on  c: 

Lemma  5:  Let 

<A  =  max  (1  ,  (A_xi^^   ■ 

Then 

c  2:  —  -  £).    . 

Proof:  The  2-norm  of  any  row  of  V(A-X^JV^  (V  unitary)  is  at  least  a^^{A-\,).  In 
particular,  the  last  row  of  [A3i,A32,A„33-X^,]  has  such  a  norm.  By  the  triangle  inequality, 
the  sum  of  the  2-norm  of  the  last  row  of  [^31,^ 32]  and  \A^yj  -X,|  is  at  least  <Tj^^(A-X,). 
Since  the  2-norra  of  the  last  row  of  [^31,^32]  is  bounded  above  by  D^||A||/r  we  have 

1^-33,-  X,|^a^,(A-X,7j-Dj|A||^      . 
or 


llAll^  llAll^ 


°""^^' — uh — ^     " 


i--D, 


Q.E.D.  ^ 

We  now  need  to  translate  our  upper  bbudd  oh  tan(9)  into  a  lower  bound  on 

Cond(X):  ._. __  J_fL2 

t  • '     ' 

—^ :  ■  ~  ~-r  -1)  ,:t.< 
Lemma  6:  Sup{>ose  the  n  dimensional  vector  i^jCSj,  where  the  space  Sj  is  perpendicular  to 

the  unit  vector  )»,.  Suppose  further  that  theaag^  6^  bkween  each  y,  and  a  fixed  unit 

vector  e  is  bounded  above  by  9[n„.  Let  Jf  be  an  n  by  n  matrix  whose  columns  are  the  x,. 

Then 

Cond(;f)  ^  —) ^ . 

I*roof:  Assume  without  loss  of  generality  that  e  =  e„,  the  unit  vector  with  a  1  in  the  n-th 
entry  and  zeroes  elsewhere.  Let  9(x,y)  denote  the  (acute)  angle  between  the  vectors  x 
and  >».  By  the  triangle  inequality  we  have 

so  that 

e('n^f)  a  ir/2  -  e(y„^J  =  IT/2  -  e,  ^  ir/2  -  8^   . 
Writing  X,  =  [x/",t,]'",  n,  an  n- 1  vector,  we  sec 

Y-j-  =  cot  (e(*„^,))  s  cot(TT/2  -  e^)  =  tan(9^) 


so 


k,|^  ||n,||  tan(9^)^  ||X||  tan(9„J    . 
Now  faaor  X  =  LQ,  L  lower  triangular  and  Q  unitary,  so  that  L  and  X  have  the  same 


10 

singular  values  and  rows  of  the  same  norm.  Thus  the  last  row  of  L  has  the  same  norm  as 

[ti €„],  which  is  in  turn  bounded  by  Vn  \\X\\  tan(9n^).    Since  the  norm  of  any 

row  of  £,  is  an  upper  bound  on  its  smallest  singular  value,  we  have 

Cond(X)  =  Cond(Z,)  a   ^  r-  „    ^^   ^ =  -^ -, 

^\\X\\xm{%^)        V„tan(9^) 

as  desired.  Q.E.D. 

Combining  these  lemmas  yields  the^theorem. 
Theorem:  Let  k,  =  ||fl|l/r/ <y„jin(5). 

'  IIAIU 


K^  =  max 


1. 


nuatTminC'^-M 


and 


^    ^   U,-A\\f  ^    \\B.-B\ 


\\M\f  Wb.Wf 

where  {A^fi^  is  the  uncontrollable  pair  which  minimizes  D^.   Then  any  solution  X  to  the 
robust  pole  assignment  problem  satisfies 

Cond(X)  ^     J--^       >  \  . 

Proof:  Let  C  -  (9  -  ^^TiyZ.   We  consider  two  cases,  £>„  k^  k^  a  f,  and  D,  k^  k^  <  (. 
In  the  first  case, 

9  -  V77        _     1 


1   . 
so  the  theorem  is  trivially  true.  In  the  second  case  we  have  from  Lemmas  4,  5  and  6  that 


2Vn  D.  Kg  K^        V7 


(1  -  D^Kg)  (c  -  j.^) 

Cond(X)  a:  -y ^ a  y =1^^^^ 

V„  tan(e^)  6V„  d.  Kg 


10 


11 


1       1  -  O.tx'^f 


D„  - 


1  -  Dk 


."•a 


1       1  -  g.tflt/ 


1   -   O.KflK^    - 


1    -    D^KgK^ 


1       ^^Li_ 
6Vn    D^KgK^ 


1-  ? 


L 


l-J 


1    g-n^-c 

6Vn        O.KgK^ 


-c&k 


,   I'l    XGrD=, 


-!-Ln 


V„D. 


KgK^ 


as  desired.  Q.E.D. 
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